Implementation of realtime STRAIGHT speech manipulation system: Report on its first implementation
نویسندگان
چکیده
A very high quality speech analysis, modification and synthesis system—STRAIGHT— has now been implemented in C language and operated in realtime. This article first provides a brief summary of STRAIGHT components and then introduces the underlying principles that enabled realtime operation. In STRAIGHT, the built-in extended pitch synchronous analysis, which does not require analysis window alignment, plays an important role in realtime implementation. A detailed description of the processing steps, which are based on the so-called ‘‘just-in-time’’ architecture, is presented. Further, discussions on other issues related to realtime implementation and performance measures are also provided. The software will be available to researchers upon request.
منابع مشابه
Design and Implementation of an Intelligent Part of Speech Generator
The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...
متن کاملTANDEM-STRAIGHT, a research tool for L2 study enabling flexible manipulations of prosodic information
A speech analysis, modification, and resynthesis system called STRAIGHT has been widely used in the speech research community. However, its foundation and implementation were not well established. This lecture introduces recent advances in STRAIGHT’s foundation based on a new concept called TANDEM, a simple method for calculating temporally stable power spectra using two F0-adaptive time window...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملA continuous VQ clustering algorithm for realtime speech recognition
This paper presents a continuous VQ clustering (CVQC) algorithm for realtime speech recognition, which incorporates the temporal information of speech into both training and recognition processes. In comparison with the conventional DTW and VQ methods, this new algorithm delivers faster training and recognition speed and smaller codebook size while still retains merits of both. Realtime impleme...
متن کاملImplementation of The First Medical science Olympiad in Iran: A report
The first national medical science Olympiad suggested by Isfahan University of Medical Sciences was hold in 2009 in Isfahan. The venture had the mission to identify and flourish potentials in Iranian medical science students - the health system's capital. The ministry of health in collaboration with the affiliated universities hosted 364 medical science students. Students formed teams of three ...
متن کامل